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SEQUENCE LISTING 



(1) 



GENERAL INFORMATION: 

(i) APPLICANT: HUMAN GENOME SCIENCES, INC. 



9410 KEY WEST AVENUE 
ROCKVILLE, MD 20850 
UNITED STATES OF AMERICA 



APPLICANTS/ INVENTORS : LI, YI 

CAO, LIANG 
ROSEN, CRAIG A. 

(ii) TITLE OF INVENTION: Human G- Protein Coupled Receptor 

(iii) NUMBER OF SEQUENCES: 3 0 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: STERNE, KESSLER, GOLDSTEIN & FOX P.L.L.C. 

(B) STREET: 1100 NEW YORK AVE., NW, SUITE 600 

(C) CITY: WASHINGTON 

(D) STATE: DC 

(E) COUNTRY: USA 

(F) ZIP: 20005 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: FLOPPY DISK 

(B) COMPUTER: IBM PC COMPATIBLE 

(C) OPERATING SYSTEM: PC-DOS/MS -DOS 

(D) SOFTWARE: PATENTIN RELEASE #1.0, VERSION #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/US95/04079 

(B) FILING DATE: 30-MAR-1995 

(C) CLASSIFICATION: 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/765,973 

(B) FILING DATE: 06-JUN-1995 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: STEFFE, ERIC K. 

(B) REGISTRATION NUMBER: 36,688 

(C) REFERENCE/DOCKET NUMBER: 1488 . 114PC0O/EKS/KLM 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 202-371-2600 

(B) TELEFAX: 202-371-2540 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1713 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 



(ii) MOLECULE TYPE: cDNA 



» 
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(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 116. .1003 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

GGCACGAGGT CATTCAACAT TTATTCAACC AAAAATACTA AGTCAGCTCT ATACAAACTA 6 0 

ATGGAAGGAT ACAGCTATGC AAATATAGAA CACTAAAGTG TTACATGACA GATGT ATG 118 

Met 
1 

AGT AGT GAA ATG GTG AAA AAT CAG ACA ATG GTC AC A GAG TTC CTC CTA 166 
Ser Ser Glu Met Val Lys Asn Gin Thr Met Val Thr Glu Phe Leu Leu 

5 10 15 

CTG GGA TTT CTC CTG GGC CCA AGG ATT CAG ATG CTC CTC TTT GGG CTC 214 
Leu Gly Phe Leu Leu Gly Pro Arg lie Gin Met Leu Leu Phe Gly Leu 
20 25 30 

TTC TCC CTG TTC TAT GTC TTC ACC CTG CTG GGG AAT GGG ACC ATC CTG 2 62 

Phe Ser Leu Phe Tyr Val Phe Thr Leu Leu Gly Asn Gly Thr lie Leu 
35 40 45 

GGG CTC ATC TCA CTG GAC TCC AGA CTC CAC ACC CCC ATG TAC TTC TTC 310 
Gly Leu lie Ser Leu Asp Ser Arg Leu His Thr Pro Met Tyr Phe Phe 
50 55 60 65 

CTC TCA CAC CTG GCC GTC GTC AAC ATC GCC TAT GCC TGC AAC ACA GTG 358 
Leu Ser His Leu Ala Val Val Asn lie Ala Tyr Ala Cys Asn Thr Val 

70 75 80 

CCC CAG ATG CTG GTG AAC CTC CTG CAT CCA GCC AAG CCC ATC TCC TTT 406 
Pro Gin Met Leu Val Asn Leu Leu His Pro Ala Lys Pro lie Ser Phe 

85 90 95 

GCT GGT TGC ATG ACA CTA GAC TTT CTC TTT TTG AGT TTT GCA CAT ACT 4 54 

Ala Gly Cys Met Thr Leu Asp Phe Leu Phe Leu Ser Phe Ala His Thr 
100 105 110 

GAA TGC CTC CTG TTG GTG CTG ATG TCC TAC GAT CGG TAC GTG GCC ATC 5 02 

Glu Cys Leu Leu Leu Val Leu Met Ser Tyr Asp Arg Tyr Val Ala lie 
115 120 125 

TGC CAC CCT CTC CGA TAT TTC ATC ATC ATG ACC TGG AAA GTC TGC ATC 550 
Cys His Pro Leu Arg Tyr Phe lie lie Met Thr Trp Lys Val Cys lie 
130 135 140 145 

ACT CTG GGC ATC ACT TCC TGG ACA TGT GGC TCC CTC CTG GCT ATG GTC 598 
Thr Leu Gly lie Thr Ser Trp Thr Cys Gly Ser Leu Leu Ala Met Val 

150 155 160 

CAT GTG AGC CTC ATC CTA AGA CTG CCC TTT TGT GGG CCT CGT GAA ATC 64 6 

His Val Ser Leu lie Leu Arg Leu Pro Phe Cys Gly Pro Arg Glu lie 

165 170 175 

AAC CAC TTC TTC TGT GAA ATC CTG TCT GTC CTC AGG CTG GCC TGT GCT 694 
Asn His Phe Phe Cys Glu lie Leu Ser Val Leu Arg Leu Ala Cys Ala 
180 185 190 



GAT ACC TGG CTC AAC CAG GTG GTC ATC TTT GAA GCC TGC ATG TTC ATC 



742 



I 



-56- 

Asp Thr Trp Leu Asn Gin Val Val lie Phe Glu Ala Cys Met Phe lie 
195 200 205 

CTG GTG GGA CCA CTC TGC CTG GTG CTG GTC TCC TAC TCA CAC ATC CTG 790 
Leu Val Gly Pro Leu Cys Leu Val Leu Val Ser Tyr Ser His lie Leu 
210 215 220 225 

GGG GGC ATC CTG AGG ATC CAG TCT GGG GAG GGC CGC AGA AAG GCC TTC 838 
Gly Gly lie Leu Arg lie Gin Ser Gly Glu Gly Arg Arg Lys Ala Phe 

230 235 240 

TCC ACC TGC TCC TCC CAC CTC TGC GTA GTG GGA CTC TTC TTT GGS AGC 886 
Ser Thr Cys Ser Ser His Leu Cys Val Val Gly Leu Phe Phe Gly Ser 

245 250 255 

GCC ATC GTC ATG TAC ATG GCC CCT AAG TCC CGC CAT CCT GAG GAG CAG 934 
Ala lie Val Met Tyr Met Ala Pro Lys Ser Arg His Pro Glu Glu Gin 
260 265 270 

CAG AAG GTC CTT TTT CTT ATT TTA CAG TTC CTT TCA ACC CCG ATG CTT 982 
Gin Lys Val Leu Phe Leu lie Leu Gin Phe Leu Ser Thr Pro Met Leu 
275 280 285 

AAA CCC CCT GAT TTA CAA CCC TGA GGAATGTAGA GGGTCAAGGG TGCCCTCCGA 1036 
Lys Pro Pro Asp Leu Gin Pro 
290 295 

GGAGAC CACT GTGCAARGRA AGTCATTCCT AAGGGGTGTG ACATTTGAAC TGCCAGCCCC 1096 

AGTTGCCCCG TGGACTCCTG ATGCCCAATT ATTGCCTCAA CCCAGAAAAG TTTACTCCCC 1156 

TTTAACTGTG CTTTACTGAC AGAAGGGCAA GCCTTCTCCC GTTTTTTGCA GATAAAATTT 1216 

TAGATGTGTT GCAATCATTG GGTTTCTAGG AGATGTGGTT TTATCAGACA ATTTTTTCTT 1276 

TTATTTCACA ATTACTTTAA TATCTGTAAA ATAAAGAATT ATTTTAAATC ATTTTCCCAG 1336 

TCCCAAAAGT TAAATACAGG CCACTTACTT CTTTAACCAA ATGATATAGT TTGGCTCTGT 1396 

GTCCCCACCC AAATCTCATG TCAAATTGTA ATCCCCGCAT GTCAGCGGAG GGACCTGGTG 1456 

GGAGGTGATT GGATCATGGG GAGGGATTTC CCCCTTGCTG TTCTGTTGAT AGTGAACGAG 1516 

TTCTCACGAA ATCTGATGGT TTAAAAGTGC AGCACTTCTC CCTTTGCTCT CTCTCTCCTG 1576 

CTGTGCCATG GTAAGACGTG CCTTGCTTCC CCTGGTGCTT CCGCCATGAT TGTACCTTTC 163 6 

CTGAGGCCTC TCCAGCCATG TGGAACTGTG AGCCAATTAA ACTTCTTTTC TTTAGAAAAA 1696 

AAAAAAAAAA AAAAAAA 1713 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 96 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 
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Met Ser Ser Glu Met Val Lys Asn Gin Thr Met Val Thr Glu Phe Leu 
15 10 15 

Leu Leu Gly Phe Leu Leu Gly Pro Arg lie Gin Met Leu Leu Phe Gly 

20 25 30 

Leu Phe Ser Leu Phe Tyr Val Phe Thr Leu Leu Gly Asn Gly Thr lie 
35 40 45 

Leu Gly Leu lie Ser Leu Asp Ser Arg Leu His Thr Pro Met Tyr Phe 
50 55 60 

Phe Leu Ser His Leu Ala Val Val Asn lie Ala Tyr Ala Cys Asn Thr 
65 70 75 80 

Val Pro Gin Met Leu Val Asn Leu Leu His Pro Ala Lys Pro lie Ser 

85 90 95 

Phe Ala Gly Cys Met Thr Leu Asp Phe Leu Phe Leu Ser Phe Ala His 

100 105 110 

Thr Glu Cys Leu Leu Leu Val Leu Met Ser Tyr Asp Arg Tyr Val Ala 
115 120 125 

lie Cys His Pro Leu Arg Tyr Phe lie lie Met Thr Trp Lys Val Cys 
130 135 140 

lie Thr Leu Gly lie Thr Ser Trp Thr Cys Gly Ser Leu Leu Ala Met 
145 150 155 160 

Val His Val Ser Leu lie Leu Arg Leu Pro Phe Cys Gly Pro Arg Glu 

165 170 175 

lie Asn His Phe Phe Cys Glu lie Leu Ser Val Leu Arg Leu Ala Cys 

180 185 190 

Ala Asp Thr Trp Leu Asn Gin Val Val lie Phe Glu Ala Cys Met Phe 
195 200 205 

lie Leu Val Gly Pro Leu Cys Leu Val Leu Val Ser Tyr Ser His lie 
210 215 220 

Leu Gly Gly lie Leu Arg lie Gin Ser Gly Glu Gly Arg Arg Lys Ala 
225 230 235 240 

Phe Ser Thr Cys Ser Ser His Leu Cys Val Val Gly Leu Phe Phe Gly 

245 250 255 

Ser Ala lie Val Met Tyr Met Ala Pro Lys Ser Arg His Pro Glu Glu 

260 265 270 

Gin Gin Lys Val Leu Phe Leu lie Leu Gin Phe Leu Ser Thr Pro Met 
275 280 285 

Leu Lys Pro Pro Asp Leu Gin Pro 
290 295 

(2) INFORMATION FOR SEQ ID NO : 3 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2185 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 



(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 884.. 2062 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 



TCACTATAGG GCGAATTGGG TACGGGCCCC CCCTCGAGGT CGACGGTATC GATAAGCTTG 60 

ATATCGAATT CGGCACGAGC CGGGCTCGGA GAGGTGACGG AACCGGGGCT GGTAGCATAG . 120 

TTTGATTTGA TGATGGAGCC AACACAGGGG TTGGAGCTGG TACCGGTGAA GCTGAGGCTA 180 

AAAAGGTTCC TGGAGTAGAC GATGGAGCCA TAACTGGAAC CGGAGTCTGT GAATGAAGCC 24 0 

AGGACAGGAG CAGCACCTGG CGATGGTGCC AGGACCGGAA GAGGAGC C AG AGGAGGAGCT 30 0 

GGAGAAGGAG CCAGAATTGC TGTCTGTGGA GCCGCCATAG GAGCCAGAGG GGTGGCTAGA 360 

GCCTGAGAAT GCAGAAGATG CTGGAGCCAG AAGGGAAGCC TGAGC TGGAG CTGGATTTGG 420 

TGCTGACGGA AAAGGACTGG CCAGAGCCGA AGCTGGCACC AGGGACAGGT GAGCATTCTG 4 80 

GGGCCACGGT TGAGTTCAAC CCACTGACTT CAGGTGAAGG ACTGTGGACC AGCTTGAGAA 540 

GAGGCCTCAC CAGAGTGGGT GTGGGGCATG GGGGCTCGAG CAGTACCCAG AGTAGGTGTG 600 

GGTAGCCCGG CCAGGGGTTA ACGTGGGGCG TGGATTCAAC ACAGCTTGGA AGCCCAGAGC 66 0 

TCGGAGGCCC GGGTGCTTGG GC CAATTGAG GAACAGGAGT CAGTCCATCC CGAGGGGGTT 72 0 

GTCTCACTAC AATCTTCACA CGCCTTTATT ATTCACCATG GTTGGTGGCA CCTGGTTAGC 780 

AGCAAGCGGA AGGCTGAGGC CAGTAGGGGC AGGGGTGTTA CTGGGGGTCG AAGAAGC C AG 84 0 

CACAGAGACA GGGGTAGGGC CAGGGGTCGG GGCCACGGCC TGG ATG AGG CCC ACA 895 

Met Arg Pro Thr 
1 

TGG GCA GGC TGG CTG ATG AGA TGG TGC TGC CCC CCT GCT GAC ACG AGG 943 
Trp Ala Gly Trp Leu Met Arg Trp Cys Cys Pro Pro Ala Asp Thr Arg 
5 10 15 2 0 

TGC ACC ACA TTC CTT TGC AGC GGG CGG GCT GCC CCA CAG CAA GCT GGC 991 
Cys Thr Thr Phe Leu Cys Ser Gly Arg Ala Ala Pro Gin Gin Ala Gly 

25 30 35 

GCA CCT GGG CAC CAT CCA AAA TAC AGC TTG TTT CCC TGG ATT TGG AAG 103 9 
Ala Pro Gly His His Pro Lys Tyr Ser Leu Phe Pro Trp lie Trp Lys 

40 45 50 

GTG AGA GGT TTG CTT CCC CCT CCA TTA ACC ACT GAC GTT GTG CCA GTG 1087 
Val Arg Gly Leu Leu Pro Pro Pro Leu Thr Thr Asp Val Val Pro Val 
55 60 65 

AGA CTA ACT CTC CGC GCC AAT CTG TCC GCG GCT GAC CTC CTT CGC GGG 113 5 



I 
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Arg Leu Thr Leu Arg Ala Asn Leu Ser Ala Ala Asp Leu Leu Arg Gly 
70 75 80 

CGT GGC CTA CCT CTT CCT CAT GTT CCA CAC TGT CCC CGC ACA GCC CGA 1183 
Arg Gly Leu Pro Leu Pro His Val Pro His Cys Pro Arg Thr Ala Arg 
85 90 95 100 

CTT TCA CTT GAG GGC TGG TTC CTG CGG CAG GGC TTG CTG GAC ACA AAC 12 31 

Leu Ser Leu Glu Gly Trp Phe Leu Arg Gin Gly Leu Leu Asp Thr Asn 

105 110 115 

CTC ACT GCG TCG GTG GCC ACA CTG CTG GCC ATC GCC GTG GAG CGG CAC 127 9 

Leu Thr Ala Ser Val Ala Thr Leu Leu Ala lie Ala Val Glu Arg His 

120 125 130 

CGC AGT GTG ATG GCC GTG CAG CTG CAC AGC CGC CTG CCC CGT GGC CGC 132 7 

Arg Ser Val Met Ala Val Gin Leu His Ser Arg Leu Pro Arg Gly Arg 
135 14 0 14 5 

GTG GTC ATG CTC ATT GTG GGC GTG TGG GTG GCT GCC CTG GGC CTG GGG 1375 
Val Val Met Leu lie Val Gly Val Trp Val Ala Ala Leu Gly Leu Gly 
150 155 160 

CTG CTG CCT GCC CAC TCC TGG CAC TGC CTC TGT GCC CTG GAC CGC TCC 1423 
Leu Leu Pro Ala His Ser Trp His Cys Leu Cys Ala Leu Asp Arg Ser 
165 170 175 180 

TCA CGC ATG GCA CCC CTG CTC AGC CGC TCC TAT TTG GCC GTC TGG GCT 14 71 

Ser Arg Met Ala Pro Leu Leu Ser Arg Ser Tyr Leu Ala Val Trp Ala 

185 190 195 

CTG TCG AGC CTG CTT GTC TTC CTG CTC ATG GTG GCT GTG TAC ACC CGC 1519 
Leu Ser Ser Leu Leu Val Phe Leu Leu Met Val Ala Val Tyr Thr Arg 

200 205 210 

ATT TTC TTC TAC GTG CGG CGG CGA GTG CAG CGC ATG GCA GAG CAT GTC 1567 
lie Phe Phe Tyr Val Arg Arg Arg Val Gin Arg Met Ala Glu His Val 
215 220 225 

AGC TGC CAC CCC CGC TAC CGA GAG ACC ACG CTC AGC CTG GTC AAG ACT 1615 
Ser Cys His Pro Arg Tyr Arg Glu Thr Thr Leu Ser Leu Val Lys Thr 
230 235 240 

GTT GTC ATC ATC CTG GGG GCG TTC GTG GTC TGC TGG ACA CCA GGC CAG 1663 
Val Val lie lie Leu Gly Ala Phe Val Val Cys Trp Thr Pro Gly Gin 
245 250 255 260 

GTG GTA CTG CTC CTG GAT GGT TTA GGC TGT GAG TCC TGC AAT GTC CTG 1711 
Val Val Leu Leu Leu Asp Gly Leu Gly Cys Glu Ser Cys Asn Val Leu 

265 270 275 

GCG TTA GAA AAG TAC TTC CTA CTG TTG GCC GAG CCA ACC TCA CTG GTC 1759 
Ala Leu Glu Lys Tyr Phe Leu Leu Leu Ala Glu Pro Thr Ser Leu Val 

280 285 290 

AAT GCT GCT GTG TAC TCT TGC CGA GAT GCT GAG ATG CGC CGC ACC TTC 1807 
Asn Ala Ala Val Tyr Ser Cys Arg Asp Ala Glu Met Arg Arg Thr Phe 
295 300 305 

CGC CGC CTT CTC CTG CTG CGC GTG CCT CCG CCA GTC CAC CCG CGA GTC 1855 
Arg Arg Leu Leu Leu Leu Arg Val Pro Pro Pro Val His Pro Arg Val 
310 315 320 
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TGT CCA CTA TAC ATC CTC TGC CCA GGG AGG TGC CAG CAC TCG CAT CAT 1903 
Cys Pro Leu Tyr lie Leu Cys Pro Gly Arg Cys Gin His Ser His His 
325 330 " 335 340 

GCT TCC CGA GAA CGG CCA CCC ACT GAT GGA CTC CAC CCT TTA GCT ACC 1951 
Ala Ser Arg Glu Arg Pro Pro Thr Asp Gly Leu His Pro Leu Ala Thr 

345 350 355 

TTG AAC TAC AGC GGT ACG CGG CAA GCA ACA AAT CCA CAG CCC CTG ATG 1999 
Leu Asn Tyr Ser Gly Thr Arg Gin Ala Thr Asn Pro Gin Pro Leu Met 

360 365 370 

ACT TGT GGG TGC TCC TGG CTC AAC CCA ACC TCG TGC CGA ATT CCT GCA 204 7 

Thr Cys Gly Cys Ser Trp Leu Asn Pro Thr Ser Cys Arg lie Pro Ala 
375 380 385 

GCC CGG GGG ATC CAC TAG TTCTAGAGCG GCGCCACCGC GGTGGAGCTC 2 0 95 

Ala Arg Gly lie His 
390 

CAGCTTTTGT TCCCTTTAGT GAGGGTTAAT TTCGAGCTTG GCGTAATCAT GGTCATAGCT 2155 
GTTTCCTGTG TGAAATTGTT ATCCGCTCAC 2185 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 93 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met Arg Pro Thr Trp Ala Gly Trp Leu Met Arg Trp Cys Cys Pro Pro 
1 " 5 10 15 

Ala Asp Thr Arg Cys Thr Thr Phe Leu Cys Ser Gly Arg Ala Ala Pro 

20 25 30 

Gin Gin Ala Gly Ala Pro Gly His His Pro Lys Tyr Ser Leu Phe Pro 
35 40 45 

Trp lie Trp Lys Val Arg Gly Leu Leu Pro Pro Pro Leu Thr Thr Asp 
50 55 60 

Val Val Pro Val Arg Leu Thr Leu Arg Ala Asn Leu Ser Ala Ala Asp 
65 70 75 80 

Leu Leu Arg Gly Arg Gly Leu Pro Leu Pro His Val Pro His Cys Pro 

85 90 95 

Arg Thr Ala Arg Leu Ser Leu Glu Gly Trp Phe Leu Arg Gin Gly Leu 

100 105 110 

Leu Asp Thr Asn Leu Thr Ala Ser Val Ala Thr Leu Leu Ala lie Ala 
115 120 125 



Val Glu Arg His Arg Ser Val Met Ala Val Gin Leu His Ser Arg Leu 
130 ~ 135 140 



-61- 



Pro Arg Gly Arg 
145 

Leu Gly Leu Gly 



Val Val Met Leu 
150 

Leu Leu Pro Ala 
165 



lie Val Gly Val 
155 

His Ser Trp His 
170 



Trp Val Ala Ala 

160 

Cys Leu Cys Ala 
175 



Leu Asp Arg Ser Ser 

180 

Ala Val Trp Ala Leu 
195 

Val Tyr Thr Arg lie 
210 

Ala Glu His Val Ser 
225 

Leu Val Lys Thr Val 

245 

Thr Pro Gly Gin Val 

260 

Cys Asn Val Leu Ala 
275 

Thr Ser Leu Val Asn 
290 

Arg Arg Thr Phe Arg 
305 

His Pro Arg Val Cys 

325 

His Ser His His Ala 

340 

Pro Leu Ala Thr Leu 
355 

Gin Pro Leu Met Thr 
370 

Arg lie Pro Ala Ala 
385 



Arg Met Ala Pro Leu Leu 

185 

Ser Ser Leu Leu Val Phe 
200 

Phe Phe Tyr Val Arg Arg 
215 

Cys His Pro Arg Tyr Arg 
230 235 

Val lie lie Leu Gly Ala 

250 

Val Leu Leu Leu Asp Gly 

265 

Leu Glu Lys Tyr Phe Leu 
280 

Ala Ala Val Tyr Ser Cys 
295 

Arg Leu Leu Leu Leu Arg 
310 315 

Pro Leu Tyr lie Leu Cys 

330 

Ser Arg Glu Arg Pro Pro 

345 

Asn Tyr Ser Gly Thr Arg 
360 

Cys Gly Cys Ser Trp Leu 
375 

Arg Gly lie His 
390 



Ser Arg Ser Tyr Leu 
190 

Leu Leu Met Val Ala 
205 

Arg Val Gin Arg Met 
220 

Glu Thr Thr Leu Ser 

240 

Phe Val Val Cys Trp 

255 

Leu Gly Cys Glu Ser 
270 

Leu Leu Ala Glu Pro 
285 

Arg Asp Ala Glu Met 
300 

Val Pro Pro Pro Val 

320 

Pro Gly Arg Cys Gin 

335 

Thr Asp Gly Leu His 
350 

Gin Ala Thr Asn Pro 
365 

Asn Pro Thr Ser Cys 
380 



(2) INFORMATION FOR SEQ ID NO : 5 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1474 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 



(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 62 . . 940 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

CGGCACGAGC ATAAGAAGAC AGAGAGAACT GAGTATCCTC CCAAAGGTGA CACTGGAAGC 60 

A ATG AAC ACC ACA GTA ATG CAA GGC TTC AAC AGA TCT AAG CGG TGC 10 6 

Met Asn Thr Thr Val Met Gin Gly Phe Asn Arg Ser Lys Arg Cys 
15 10 15 

CCC AAA GAC ACT CGG ATA GTA CAG CTG GTA TTC CCA GCC CTC TAC ACA 154 
Pro Lys Asp Thr Arg lie Val Gin Leu Val Phe Pro Ala Leu Tyr Thr 

20 25 30 

GTG GTT TTC TTG ACC GGA ATC CTG CTG AAT ACT TTG GCT CTG TGG GTG 202 
Val Val Phe Leu Thr Gly lie Leu Leu Asn Thr Leu Ala Leu Trp Val 

35 40 45 

TTT GTT CAC ATC CCC AGC TCC TCC ACC TTC ATC ATC TAC CTC AAA AAC 250 
Phe Val His lie Pro Ser Ser Ser Thr Phe lie lie Tyr Leu Lys Asn 
50 55 60 

ACT TTG GTG GCC GAC TTG ATA ATG ACA CTC ATG CTT CCT TTC AAA ATC 298 
Thr Leu Val Ala Asp Leu lie Met Thr Leu Met Leu Pro Phe Lys lie 
65 70 75 

CTC TCT GAC TCA CAC CTG GCA CCC TGG CAG CTC AGA GCT TTT GTG TGT 34 6 

Leu Ser Asp Ser His Leu Ala Pro Trp Gin Leu Arg Ala Phe Val Cys 
80 85 90 95 

CGT TTT TCT TCG GTG ATA TTT TAT GAG ACC ATG TAT GTG GGC ATC GTG 3 94 

Arg Phe Ser Ser Val lie Phe Tyr Glu Thr Met Tyr Val Gly lie Val 

100 105 110 

CTG TTA GGG CTC ATA GCC TTT GAC AGA TTC CTC AAG ATC ATC AGA CCT 44 2 

Leu Leu Gly Leu lie Ala Phe Asp Arg Phe Leu Lys lie lie Arg Pro 

115 120 125 

TTG AGA AAT ATT TTT CTA AAA AAA CCT GTT TGG GGA AAA ACG GTC TCA 4 90 

Leu Arg Asn lie Phe Leu Lys Lys Pro Val Trp Gly Lys Thr Val Ser 
130 135 140 

ATC TTC ATC TGG TTC TTT TGG TTC TTC ATC TCC CTG CCA AAT ATG ATC 53 8 

lie Phe lie Trp Phe Phe Trp Phe Phe lie Ser Leu Pro Asn Met lie 
145 150 155 

TTG AGC AAC AAG GAA GCA ACA CCA TCG TCT GTG AAA AAG TGT GCT TCC 586 
Leu Ser Asn Lys Glu Ala Thr Pro Ser Ser Val Lys Lys Cys Ala Ser 
160 165 170 175 

TTA AAG GGG CCT CTG GGG CTG AAA TGG CAT CAA ATG GTA AAT AAC ATA 634 
Leu Lys Gly Pro Leu Gly Leu Lys Trp His Gin Met Val Asn Asn lie 

180 185 190 

TGC CAG TTT ATT TTC TGG ACT GTT TTT ATC CTA ATG CTT GTG TTT TAT 682 
Cys Gin Phe lie Phe Trp Thr Val Phe lie Leu Met Leu Val Phe Tyr 

195 200 205 

GTG GTT ATT GCA AAA AAG TAT ATG ATT CTT ATA GAA AGT CCA AAA GTA 730 
Val Val lie Ala Lys Lys Tyr Met lie Leu lie Glu Ser Pro Lys Val 
210 215 220 



AGG ACA GAA AAA ACA ACA AAA AGC TGG AAG GCA AAG TAT TTG TTG TCG 



778 
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Arg Thr Glu Lys Thr Thr Lys Ser Trp Lys Ala Lys Tyr Leu Leu Ser 
225 230 235 

TGG CTG TCT TCT TTG TGT GTT TTG CTC CAT TTC ATT TCG CCA GAG TTC 82 6 

Trp Leu Ser Ser Leu Cys Val Leu Leu His Phe lie Ser Pro Glu Phe 
240 245 250 255 

CAT ATA CTC ACA GTC AAA CCA ACA ATA AGA CTG ACT GTA GAC TGC AAA 874 
His lie Leu Thr Val Lys Pro Thr lie Arg Leu Thr Val Asp Cys Lys 

260 265 270 

ATC AAC TGT TTA TTG CTA AAG AAA CAA CTC TCT TTT TGG CAG CAA CTA 922 
lie Asn Cys Leu Leu Leu Lys Lys Gin Leu Ser Phe Trp Gin Gin Leu 

275 280 285 

ACA TTT GTA TGG ATC CCT TAA TATACATATT CTTATGTAAA AAATTCACAG 973 
Thr Phe Val Trp lie Pro 
290 

AAAAGCTACC ATGTATGCAA GGGAGAAAGA CCACAGCATC AAGCCAAGAA AATCATAGCA 10 33 

GTCAGACAGA C AAC AT AAC C TTAGGC TGAC AACTGTACAT AGGGGTAACT TCTATTTATT 1093 

GATGAGACTT CCGTAGATAA TGTGGAAATC CAATTTAACC AAGAAAAAAA GATTGGGGCA 1153 

AATGCTCTCT TACATTTTAT TATCCTGGTG TACAGAAAAG ATTATATAAA ATTTAAATCC 1213 

ACATAGATCT ATT C AT AAGC TGAATGAACC ATTACTAAGA GAATGCAACA GGATACAAAT 12 7 3 

GGCCACTAGA GGTCATTATT TCTTTCTTTC TTTCTTTTTT TTTTTTTAAT TTCAAGAGCA 13 33 

TTTCACTTTA ACATTTTGGA AAAGAC T AAG GAGAAACGTA TATCCCTACA AACCTCCCCT 13 93 

CCAAACACCT TCTTACATTC TTTTCCACAA TTCACATAAC ACTACTGCTT TTGTGCCCCT 14 53 

TAAATGTAGA TTTGTTGGCT G 1474 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 93 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Met Asn Thr Thr Val Met Gin Gly Phe Asn Arg Ser Lys Arg Cys Pro 
15 10 15 

Lys Asp Thr Arg lie Val Gin Leu Val Phe Pro Ala Leu Tyr Thr Val 

20 25 30 

Val Phe Leu Thr Gly lie Leu Leu Asn Thr Leu Ala Leu Trp Val Phe 
35 40 45 

Val His lie Pro Ser Ser Ser Thr Phe lie lie Tyr Leu Lys Asn Thr 
50 55 60 

Leu Val Ala Asp Leu lie Met Thr Leu Met Leu Pro Phe Lys lie Leu 
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65 70 75 80 

Ser Asp Ser His Leu Ala Pro Trp Gin Leu Arg Ala Phe Val Cys Arg 

85 90 95 

Phe Ser Ser Val lie Phe Tyr Glu Thr Met Tyr Val Gly lie Val Leu 

100 105 110 

Leu Gly Leu lie Ala Phe Asp Arg Phe Leu Lys lie lie Arg Pro Leu 
115 120 125 

Arg Asn lie Phe Leu Lys Lys Pro Val Trp Gly Lys Thr Val Ser lie 
130 135 140 

Phe lie Trp Phe Phe Trp Phe Phe lie Ser Leu Pro Asn Met lie Leu 
145 150 155 160 

Ser Asn Lys Glu Ala Thr Pro Ser Ser Val Lys Lys Cys Ala Ser Leu 

165 170 175 

Lys Gly Pro Leu Gly Leu Lys Trp His Gin Met Val Asn Asn lie Cys 

180 185 190 

Gin Phe lie Phe Trp Thr Val Phe lie Leu Met Leu Val Phe Tyr Val 
195 200 205 

Val lie Ala Lys Lys Tyr Met lie Leu lie Glu Ser Pro Lys Val Arg 
210 215 220 

Thr Glu Lys Thr Thr Lys Ser Trp Lys Ala Lys Tyr Leu Leu Ser Trp 
225 230 235 240 

Leu Ser Ser Leu Cys Val Leu Leu His Phe lie Ser Pro Glu Phe His 

245 250 255 

lie Leu Thr Val Lys Pro Thr lie Arg Leu Thr Val Asp Cys Lys lie 

260 265 270 

Asn Cys Leu Leu Leu Lys Lys Gin Leu Ser Phe Trp Gin Gin Leu Thr 
275 280 285 

Phe Val Trp lie Pro 
290 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 01 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 161 1192 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
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TTTTGGGTAT TTCTGAGAAA AAGGAAATAT TTATAAAACC ATCCAAAGAT CCAGATAATT 60 

TGCAAATAAA TTGGAGGTTA TAGAGGTTAT AATCTGAATC CCAAAGGAGA CTGCAGCTGA 120 

TGAAAGTGCT TCCAAACTGA AAATTGGACG TGCCTTTACG ATG GTA AGC GTT AAC 175 

Met Val Ser Val Asn 
1 5 

AGC TCC CAC TGC TTC TAT AAT GAC TCC TTT AAG TAC ACT TTG TAT GGG 223 
Ser Ser His Cys Phe Tyr Asn Asp Ser Phe Lys Tyr Thr Leu Tyr Gly 

10 15 20 

TGC ATG TTC AGC ATG GTG TTT GTG CTT GGG TTA ATA TCC AAT TGT GTT 271 
Cys Met Phe Ser Met Val Phe Val Leu Gly Leu lie Ser Asn Cys Val 

25 30 35 

GCC ATA TAC ATT TTC ATC TGC GTC CTC AAA GTC CGA AAT GAA ACT ACA 319 
Ala lie Tyr lie Phe lie Cys Val Leu Lys Val Arg Asn Glu Thr Thr 
40 45 50 

ACT TAC ATG ATT AAC TTG GCA ATG TCA GAC TTG CTT TTT GTT TTT ACT 3 67 

Thr Tyr Met lie Asn Leu Ala Met Ser Asp Leu Leu Phe Val Phe Thr 
55 60 65 

TTA CCC TTC AGG ATT TTT TAC TTC ACA ACA CGG AAT TGG CCA TTT GGA 415 
Leu Pro Phe Arg lie Phe Tyr Phe Thr Thr Arg Asn Trp Pro Phe Gly 
70 75 80 85 

GAT TTA CTT TGT AAG ATT TCT GTG ATG CTG TTT TAT ACC AAC ATG TAC 463 
Asp Leu Leu Cys Lys lie Ser Val Met Leu Phe Tyr Thr Asn Met Tyr 

90 95 100 

GGA AGC ATT CTG TTC TTA ACC TGT ATT AGT GTA GAT CGA TTT CTG GCA 511 
Gly Ser lie Leu Phe Leu Thr Cys lie Ser Val Asp Arg Phe Leu Ala 

105 110 115 

ATT GTC TAC CCA TTT AAG TCA AAG ACT CTA AGA ACC AAA AGA AAT GCA 559 
lie Val Tyr Pro Phe Lys Ser Lys Thr Leu Arg Thr Lys Arg Asn Ala 
120 125 ~ 130 

AAG ATT GTT TGC ACT GGC GTG TGG TTA ACT GTG ATC GGA GGA AGT GCA 607 
Lys lie Val Cys Thr Gly Val Trp Leu Thr Val lie Gly Gly Ser Ala 
135 140 145 

CCC GCC GTT TTT GTT CAG TCT ACC CAC TCT CAG GGT AAC AAT GCC TCA 655 
Pro Ala Val Phe Val Gin Ser Thr His Ser Gin Gly Asn Asn Ala Ser 
150 155 160 165 

GAA GCC TGC TTT GAA AAT TTT CCA GAA GCC ACA TGG AAA ACA TAT CTC 703 
Glu Ala Cys Phe Glu Asn Phe Pro Glu Ala Thr Trp Lys Thr Tyr Leu 

170 175 180 

TCA AGG ATT GTA ATT TTC ATC GAA ATA GTG GGA TTT TTT ATT CCT CTA 751 
Ser Arg lie Val lie Phe lie Glu lie Val Gly Phe Phe lie Pro Leu 

185 190 195 

ATT TTA AAT GTA ACT TGT TCT AGT ATG GTG CTA AAA ACT TTA ACC AAA 7 99 

lie Leu Asn Val Thr Cys Ser Ser Met Val Leu Lys Thr Leu Thr Lys 
200 205 210 

CCT GTT ACA TTA AGT AGA AGC AAA ATA AAC AAA ACT AAG GTT TTA AAA 84 7 

Pro Val Thr Leu Ser Arg Ser Lys lie Asn Lys Thr Lys Val Leu Lys 
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215 220 225 

ATG ATT TTT GTA CAT TTG ATC ATA TTC TGT TTC TGT TTT GTT CCT TAC 895 
Met lie Phe Val His Leu lie lie Phe Cys Phe Cys Phe Val Pro Tyr 
230 235 240 245 

AAT ATC AAT CTT ATT TTA TAT TCT CTT GTG AGA ACA CAA AC A TTT GTT 94 3 

Asn lie Asn Leu lie Leu Tyr Ser Leu Val Arg Thr Gin Thr Phe Val 

250 255 260 

AAT TGC TCA GTA GTG GCA GCA GTA AGG ACA ATG TAC CCA ATC ACT CTC 991 
Asn Cys Ser Val Val Ala Ala Val Arg Thr Met Tyr Pro lie Thr Leu 

265 270 275 

TGT ATT GCT GTT TCC AAC TGT TGT TTT GAC CCT ATA GTT TAC TAC TTT 103 9 

Cys lie Ala Val Ser Asn Cys Cys Phe Asp Pro lie Val Tyr Tyr Phe 
280 285 290 

ACA TCG GAC ACA ATT CAG AAT TCA ATA AAA ATG AAA AAC TGG TCT GTC 1087 
Thr Ser Asp Thr lie Gin Asn Ser lie Lys Met Lys Asn Trp Ser Val 
295 300 305 

AGG AGA AGT GAC TTC AGA TTC TCT GAA GTT CAT GGT GCA GAG AAT TTT 113 5 

Arg Arg Ser Asp Phe Arg Phe Ser Glu Val His Gly Ala Glu Asn Phe 
310 315 320 325 

ATT CAG CAT AAC CTA CAG ACC TTA AAA AGT AAG ATA TTT GAC AAT GAA 1183 
lie Gin His Asn Leu Gin Thr Leu Lys Ser Lys lie Phe Asp Asn Glu 

330 335 340 

TCT GCT GCC TGA AATAAAAC C A TTAGGACTCA C TGGGAC AGA ACTTTCAAGT 1235 
Ser Ala Ala 

TCCTTCAACT GTGAAAAGTG TCTTTTTGGA CAAACTATTT TTCCACCTCC AAAAGAAATT 1295 



AACACA 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 344 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Met Val Ser Val Asn Ser Ser His Cys Phe Tyr Asn Asp Ser Phe Lys 
1 5 10 15 

Tyr Thr Leu Tyr Gly Cys Met Phe Ser Met Val Phe Val Leu Gly Leu 

20 25 30 

lie Ser Asn Cys Val Ala lie Tyr lie Phe lie Cys Val Leu Lys Val 
35 40 45 

* 

Arg Asn Glu Thr Thr Thr Tyr Met lie Asn Leu Ala Met Ser Asp Leu 
50 55 60 



1301 
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Leu Phe Val Phe Thr Leu Pro Phe Arg lie Phe Tyr Phe Thr Thr Arg 
65 70 75 80 

Asn Trp Pro Phe Gly Asp Leu Leu Cys Lys lie Ser Val Met Leu Phe 

85 90 95 

Tyr Thr Asn Met Tyr Gly Ser lie Leu Phe Leu Thr Cys lie Ser Val 

100 105 110 

Asp Arg Phe Leu Ala lie Val Tyr Pro Phe Lys Ser Lys Thr Leu Arg 
115 120 125 

Thr Lys Arg Asn Ala Lys lie Val Cys Thr Gly Val Trp Leu Thr Val 
130 135 140 

lie Gly Gly Ser Ala Pro Ala Val Phe Val Gin Ser Thr His Ser Gin 
145 150 155 160 

Gly Asn Asn Ala Ser Glu Ala Cys Phe Glu Asn Phe Pro Glu Ala Thr 

165 170 175 

Trp Lys Thr Tyr Leu Ser Arg lie Val lie Phe lie Glu lie Val Gly 

180 185 190 

Phe Phe lie Pro Leu lie Leu Asn Val Thr Cys Ser Ser Met Val Leu 
195 200 205 

Lys Thr Leu Thr Lys Pro Val Thr Leu Ser Arg Ser Lys lie Asn Lys 
210 215 220 

Thr Lys Val Leu Lys Met lie Phe Val His Leu lie lie Phe Cys Phe 
225 230 235 240 

Cys Phe Val Pro Tyr Asn lie Asn Leu lie Leu Tyr Ser Leu Val Arg 

245 250 255 

Thr Gin Thr Phe Val Asn Cys Ser Val Val Ala Ala Val Arg Thr Met 

260 265 v 270 

Tyr Pro lie Thr Leu Cys lie Ala Val Ser Asn Cys Cys Phe Asp Pro 
275 280 285 

lie Val Tyr Tyr Phe Thr Ser Asp Thr lie Gin Asn Ser lie Lys Met 
290 295 300 

Lys Asn Trp Ser Val Arg Arg Ser Asp Phe Arg Phe Ser Glu Val His 
305 310 315 320 

Gly Ala Glu Asn Phe lie Gin His Asn Leu Gin Thr Leu Lys Ser Lys 

325 330 335 

lie Phe Asp Asn Glu Ser Ala Ala 

340 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 3 0 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 
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<ii) MOLECULE TYPE: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
GACTAAAGCT TAATGAGTAG TGAAATGGTG 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 31 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 
GAACTTCTAG AC C CTCAGGG TTGTAAATCA G 

(2) INFORMATION FOR SEQ ID NO : 11 : 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 30 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 
GACTAAAGCT TAATGAGGCC CACATGGGCA 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 32 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Oligonucleotide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 

GAACTTCTAG ACGAACTAGT GGATCCCCCC GG 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 3 0 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

( C ) STRANDEDNESS : S INGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 
GACTAAAGCT TAATGGCGTC TTTCTCTGCT 

(2) INFORMATION FOR SEQ ID NO : 14 : 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 3 0 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 



(ii) MOLECULE TYPE: Oligonucleotide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 
GAACTTCTAG ACTTCACACA GTTGTAC TAT 30 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 3 0 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GACTAAAGCT TAATGGTAAG CGTTAACAGC 3 0 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 31 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 
GAACTTCTAG ACTTCAGGCA GC AG ATT CAT T 31 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 34 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GTCCAAGCTT GCCACCATGA GTAGTGAAAT GGTG 34 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 58 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
CTAGCTCGAG TCAAGCGTAG TCTGGGACGT CGTATGGGTA GCAGGGTTGT AAATCAGG 

(2) INFORMATION FOR SEQ ID NO: 19: 

( i ) SEQUENCE " CHARACTERISTICS 

(A) LENGTH: 34 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Oligonucleotide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
GTCCAAGCTT GCCACCATGG TTGGTGGCAC CTGG 34 
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(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 58 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
CTAGCTCGAG TCAAGCGTAG TCTGGGACGT CGTATGGGTA GCAGTGGATC CCCCGTGC 5 8 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 34 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 
GTCCAAGCTT GCCACCATGA ACACCACAGT AATG 34 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 61 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 
CTAGCTCGAG TCAAGCGTAG TCTGGGACGT CGTATGGGTA GCAAGGGATC CATACAAATG 60 
T 61 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 34 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
GTCCAAGCTT GCCACCATGG TAAGCGTTAA CAGC 34 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 61 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Oligonucleotide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
CTAGCTCGAG TCAAGCGTAG TCTGGGACGT CGTATGGGTA GCAGGCAGCA GATTCATTGT 60 
c 61 



-71- 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 30 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
CGGGATCCCT CCATGAGTAG TGAAATGGTG 3 0 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 2 9 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
CGGGATCCCG CTCAGGGTTG TAAATCAGG 2 9 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 222 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DE, 

Phe Phe Leu Ser 
1 

Thr Val Pro Gin 

20 

Ser Tyr Ala Gly 
35 

lie Thr Glu Cys 
50 

Ala lie Cys His 
65 

Cys Ser Thr Met 



Leu lie His Leu 

100 



CRIPTION: SEQ I! 

His Leu Ala lie 
5 

Met Leu Val Asn 



Cys Met Thr Gin 

40 

Leu Leu Leu Val 
55 

Pro Leu Arg Tyr 
70 

Ala Val Thr Ser 
85 

Val Leu Leu Leu 



i NO: 27 : 

Val Asp lie Ala 
10 

Leu Leu Asp Pro 
25 

Thr Phe Leu Phe 



Val Met Ser Tyr 

60 

Ser Ala lie Met 
75 

Trp lie lie Gly 
90 

Pro Leu Pro Phe 
105 



Tyr Ala Cys Asn . 
15 

Val Lys Pro lie 
30 

Leu Thr Phe Ala 
45 

Asp Arg Tyr Val 



Ser Trp Arg Val 

80 

Val Leu Leu Ser 
95 

Cys Val Ser Gin 
110 
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Lys Val Asn His 
115 

Cys Ala Asp Thr 
130 

Ser Val Leu Val 
145 

lie Leu Gly Ala 



Ala Phe Ser Thr 

180 

Gly Thr Ala lie 
195 

Glu Gin Lys Lys 
210 



Phe Phe Cys Glu 

120 

His Leu Asn Glu 
135 

Gly Pro Phe Ser 
150 

lie Leu Lys lie 
165 

Cys Ser Ser His 



Val Met Tyr Val 

200 

Tyr Leu Leu Leu 
215 



lie Thr Ala lie 



Thr Met Val Leu 

140 

Ser lie Val Val 
155 

Gin Ser Glu Glu 
170 

Leu Cys Val Val 
185 

Gly Pro Arg His 



Phe His Ser Leu 

220 



Leu Lys Leu Ala 
125 

Ala Gly Ala Val 



Ser Tyr Ala Cys 

160 

Gly Gin Arg Lys 
175 

Gly Leu Phe Tyr 
190 

Gly Ser Pro Lys 
205 

Phe Asn 



INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 381 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DE 

Met Gly Pro Thr 
1 

Ser Asp Tyr Val 

20 

Gly Lys Leu Asn 
35 

Ser Val Val Phe 
50 

Phe Val Leu Leu 
65 

Tyr Tyr Phe lie 



Ala Tyr Thr Ala 

100 

Thr Pro Ala Gin 
115 

Ser Ala Ser Val 
130 



CRIPTION: SEQ I 

Ser Val Pro Leu 
5 

Asn Tyr Asp lie 



lie Ser Ala Asp 

40 

lie Leu lie Cys 
55 

Thr lie Trp Lys 
70 

Gly Asn Leu Ala 
85 

Asn Leu Leu Leu 



Trp Phe Leu Arg 

120 

Phe Ser Leu Leu 
135 



i NO: 28 : 

Val Lys Ala His 
10 

lie Val Arg His 
25 

Lys Glu Asn Ser 



Cys Phe lie lie 

60 

Thr Lys Lys Phe 
75 

Leu Ser Asp Leu 
90 

Ser Gly Ala Thr 
105 

Glu Gly Ser Met 



Ala lie Ala lie 

140 



Arg Ser Ser Val 
15 

Tyr Asn Tyr Thr 
30 

lie Lys Leu Thr 
45 

Leu Glu Asn lie 



His Arg Pro Met 

80 

Leu Ala Gly Val 
95 

Thr Tyr Lys Leu 
110 

Phe Val Ala Leu 
125 

Glu Arg Tyr lie 
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Thr Met Leu Lys Met Lys Leu His Asn Gly Ser Asn Asn Phe Arg Leu 
145 150 155 160 

Phe Leu Leu lie Ser Ala Cys Trp Val lie Ser Leu lie Leu Gly Gly 

165 170 175 

Leu Pro lie Met Gly Trp Asn Cys lie Ser Ala Leu Ser Ser Cys Ser 

180 185 190 

Thr Val Leu Pro Leu Tyr His Lys His Tyr lie Leu Phe Cys Thr Thr 
195 200 205 

Val Phe Thr Leu Leu Leu Leu Ser lie Val lie Leu Tyr Cys Arg lie 
210 215 220 

Tyr Ser Leu Val Arg Thr Arg Ser Arg Arg Leu Thr Phe Arg Lys Asn 
225 230 235 240 

lie Ser Lys Ala Ser Arg Ser Ser Glu Asn Val Ala Leu Leu Lys Thr 

245 250 255 

Val lie lie Val Leu Ser Val Phe lie Ala Cys Trp Ala Pro Leu Phe 

260 265 270 

lie Leu Leu Leu Leu Asp Val Gly Cys Lys Val Lys Thr Cys Asp lie 
275 280 285 

Leu Phe Arg Ala Glu Tyr Phe Leu Val Leu Ala Val Leu Asn Ser Gly 
290 295 300 

Thr Asn Pro lie lie Tyr Thr Leu Thr Asn Lys Glu Met Arg Arg Ala 
305 310 315 320 

Phe lie Arg He Met Ser Cys Cys Lys Cys Pro Ser Gly Asp Ser Ala 

325 330 335 

Gly Lys Phe Lys Arg Pro He He Ala Gly Met Glu Phe Ser Arg Ser 

340 345 350 

Lys Ser Asp Asn Ser Ser His Pro Gin Lys Asp Glu Gly Asp Asn Pro 
355 360 365 

Glu Thr He Met Ser Ser Gly Asn Val Asn Ser Ser Ser 
370 375 380 

INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 325 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

He Asn Ser Thr Ser Thr Gin Pro Pro Asp Glu Ser Cys Ser Gin Asn 
15 10 15 
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Leu Leu lie Thr Gin Gin lie lie Pro Val Leu Tyr Cys Met Val Phe 

20 25 30 

lie Ala Gly lie Leu Leu Asn Gly Val Ser Gly Trp lie Phe Phe Tyr 
35 40 45 

Val Pro Ser Ser Lys Ser Phe lie lie Tyr Leu Lys Asn lie Val lie 
50 55 60 

Ala Asp Phe Val Met Ser Leu Thr Phe Pro Phe Lys lie Leu Gly Asp 
65 70 75 80 

Ser Gly Leu Gly Pro Trp Gin Leu Asn Val Phe Val Cys Arg Val Ser 

85 90 95 

Ala Val Leu Phe Tyr Val Asn Met Tyr Val Ser lie Val Phe Phe Gly 

100 105 , HO 

Leu lie Ser Phe Asp Arg Tyr Tyr Lys He Val Lys Pro Leu Trp Thr 
115 120 125 

Ser Phe He Gin Ser Val Ser Tyr Ser Lys Leu Leu Ser Val He Val 
130 135 140 

Trp Met Leu Met Leu Leu Leu Ala Val Pro Asn He He Leu Thr Asn 
145 150 155 160 

Gin Ser Val Arg Glu Val Thr Gin He Lys Cys He Glu Leu Lys Ser 

165 170 175 

Glu Leu Gly Arg Lys Trp His Lys Ala Ser Asn Tyr He Phe Val Ala 

180 185 190 

He Phe Trp He Val Phe Leu Leu Leu He Val Phe Tyr Thr Ala He 
195 200 205 

Thr Lys Lys He Phe Lys Ser His Leu Lys Ser Ser Arg Asn Ser Thr 
210 215 220 

Ser Val Lys Lys Lys Ser Ser Arg Asn He Phe Ser He Val Phe Val 
225 230 235 240 

Phe Phe Val Cys Phe Val Pro Tyr His He Ala Arg He Pro Tyr Thr 

245 250 255 

Lys Ser Gin Thr Glu Ala His Tyr Ser Cys Gin Ser Lys Glu He Leu 

260 265 270 

Arg Tyr Met Lys Glu Phe Thr Leu Leu Leu Ser Ala Ala Asn Val Cys 
275 280 285 

Leu Asp Pro He He Tyr Phe Phe Leu Cys Gin Pro Phe Arg Glu He 
290 295 300 

Leu Cys Lys Lys Leu His He Pro Leu Lys Ala Gin Asn Asp Leu Asp 
305 . 310 315 320 

He Ser Arg He Lys 

325 

INFORMATION FOR SEQ ID NO: 30: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 302 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Ser Ser Asn Cys Ser Thr Glu Asp Ser Phe Lys Tyr Thr Leu Tyr Gly 
! 5 10 15 

Cys Val Phe Ser Met Val Phe Val Leu Gly Leu lie Ala Asn Cys Val 

20 25 30 

Ala lie Tyr lie Phe Thr Phe Thr Leu Lys Val Arg Asn Glu Thr Thr 
35 40 45 

Thr Tyr Met Leu Met Leu Ala lie Ser Asp Leu Leu Phe Val Phe Thr 
50 55 60 

Leu Pro Phe Arg He Tyr Tyr Phe Val Val Arg Asn Trp Pro Phe Gly 
65 70 75 80 

Asp Val Leu Cys Lys He Ser Val Thr Leu Phe Tyr Thr Asn Met Tyr 

85 90 95 

Gly Ser He Leu Phe Leu Thr Cys He Ser Val Asp Arg Phe Leu Ala 

100 105 HO 

He Val His Pro Phe Arg Ser Lys Thr Leu Arg Thr Lys Arg Asn Ala 
115 120 125 

Arg He Val Cys Val Ala Val Trp He Thr Val Leu Ala Gly Ser Thr 
130 135 140 

Pro Ala Ser Phe Phe Gin Ser Thr Asn Arg Gin Asn Asn Thr Glu Gin 
145 150 155 160 

Arq Thr Cys Phe Glu Asn Phe Pro Glu Ser Thr Trp Lys Thr Tyr Leu 

165 170 175 

Ser Arg He Val He Phe He Glu He Val Gly Phe Phe He Pro Leu 

180 185 190 

He Leu Asn Val Thr Cys Ser Thr Met Val Leu Arg Thr Leu Asn Lys 
195 200 205 

Pro Leu Thr Leu Ser Arg Asn Lys Leu Ser Lys Lys Lys Val Leu Lys 
2io 215 220 

Met He Phe Val His Leu Val He Phe Cys Phe Cys Phe Val Pro Tyr 
225 230 235 240 

Asn He Thr Leu He Leu Tyr Ser Leu Met Arg Thr Gin Thr Trp He 

245 250 255 

Asn Cys Ser Val Val Thr Ala Val Arg Thr Met Tyr Pro Val Thr Leu 



-76- 



Cys lie Ala Val 
275 

Thr Ser Asp Thr 
290 



Ser Asn Cys Cys 

280 

Asn Ser Glu Leu 
295 



265 

Phe Asp Pro lie 

Asp Lys Lys Gin 

300 



270 

Val Tyr Tyr Phe 
285 

Gin Val 



r 



r 



